THE CHARACTERISTIC POLYNOMIAL AND DETERMINANT 
ARE NOT AD HOC CONSTRUCTIONS 

R. SKIP GARIBALDI 



Most people are first introduced to the characteristic polynomial and determi- 
nant of a matrix in a linear algebra course as undergraduates. The determinant 
is usually defined as an alternating sum of products of entries of the matrix (as 
in Jacobi [Jacobi 18411 §4], [Weberl §23], and |Br01l §6.1]) or as the unique map 
Mn{F) —f F which is multilinear and alternating in the columns and which is 1 on 
the identity matrix (as in Weierstrass |Wei| and the books by Hungerford |Hu 80| : 
Lang [Lang 02| ; and Dummit and Foote). As a student, I thought these definitions 
were at best magical and at worst ad hoc. Where did the determinant come from? 
This paper gives definitions which I hope the reader will find more natural. 

Now, the determinant of a linear transformation on M" is a natural enough 
object: its absolute value gives the factor by which the transformation enlarges 
volumes, and its sign says whether or not the map preserves orientation. These 
properties imply Weierstrass' axioms, see e.g. jHa 96| or |Lax 971 §5]. 

Another good definition of the determinant — not so common at the undergrad- 
uate level — is in terms of the n-th exterior power of A"-F" as in jBo 481 Ch. Ill, 
§6]. This also leads to the Weierstrass axioms. 

But even these two "good" definitions have a taint of being special to matrices. 
(The first is even limited to matrices with real entries.) After all, analogues of 
the determinant are known for the quaternions, the octonions, finite-degree field 
extensions... It is not clear how to adapt the two good definitions to handle these 
algebras. As mathematicians, we should demand a definition that works simulta- 
neously in all cases. Here we give a definition of the characteristic polynomial in ^ 
which works for all of these cases; the constant term of this characteristic polyno- 
mial gives an analogue of the determinant. For n-by-n matrices, we derive Jacobi's 
alternating sum formula for the determinant, see 12.61 2. We also recover the known 
ad hoc formulas for the determinant for quaternions fin 13. II) and finite-degree field 
extensions fin l4.4|) . Moreover, the product formula det(aa') — det(a)det(a') always 
holds, see fjSl 

(One normally begins with a definition for the determinant and then defines the 
characteristic polynomial of a matrix a as det(2; — a). We work in the opposite 
direction here.) 

The philosophy is the following. Consider the lines in given by the equations 

ax + by — c and a'x + b'y — c'. 

If the coefficients a, 6, c, a', 6', c' are specific real numbers, the lines might be 
parallel or the same (degenerate case) , but "typically" they intersect at exactly one 
point. If we treat the coefficients as independent indeterminates, we say that the 
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lines are generic. Such lines intersect at the point 

, . f b'c— c'b ad — a! c\ 
" \ah' -a'h' ah' - a'b J ' 

Similarly, a typical n-by-n matrix has n distinct eigenvalues."'^ For such a matrix, 
the traditional characteristic polynomial is just the minimal polynomial. To define 
the characteristic polynomial of a particular matrix a, we first find the minimal 
polynomial min. po\y.^{x) of a generic matrix 7. Plugging in specific values for the 
indeterminates in 7, we get a polynomial whose only indeterminate is x, and this 
is the characteristic polynomial of a. This method of defining the characteristic 
polynomial works for all finite-dimensional i^-algebras, and the determinant is (up 
to a sign) the constant term of the characteristic polynomial. 

The core of the idea — looking at the minimal polynomial of a generic element — 
goes back to the late 1800s, see [Study 1889| p. 241] and |Scl891[ p. 301]. However, 
all treatments that I have found do not develop the properties of the general charac- 
teristic polynomial (as in |Dic 38j and |De 48 |). or they make use of known properties 
of the characteristic polynomial and determinant for matrices in studying the gen- 
eral characteristic polynomial (as in |A1 611 §1.13], |Jacobson68| . |Jacobson63| . and 
[Jacobson 431 §5.18]). We use only elementary properties of matrices from the very 
nice paper |Ax 95j . 

Readers with an algebraic background may argue that one can obtain the char- 
acteristic polynomial of an n-by-n matrix over an arbitrary field F by applying the 
structure theory for finitely-generated torsion modules over a PID (as is done in 
|He 751 §6.7]). But once one is using that much algebra, the contents of this paper 
are not so far away, and the results here are much stronger. 

1. The characteristic polynomial 

In this section, we define the characteristic polynomial of an element a in a 
finite-dimensional F-algebra A ()1.3|l and give some of its basic properties (|1.10(l . 

Definition 1.1. An F-algebra is a ring A with a multiplicative identity 1 7^ 
such that A is an i<"-vector space and a{ab) — (aa)b — a{ab) for all a G and 
a,b € A. (Alternately, ^ is a ring with identity such that there is a monomorphism 
F — > Z{A) which maps the identity in F to the identity in A.) All algebras that 
we consider will be finite- dimensional as vector spaces over F. 

Principal examples of _F-algebras are the n-hy-n matrices Af„ (F) and Hamilton's 
quaternions, which form an M-algebra f see 13. 111 . 

Note that the definition gives a copy of F inside the center of A, but there is no 
requirement that F is the entire center of A; for example, A may be taken to be a 
finite-degree field extension of F. 

We do require that A is associative, e.g., A cannot be the octonions. However, 
the definition of the characteristic polynomial given below also works for the much 
broader class of strictly power-associative F-algebras, which includes the octonions 

^If you randomly choose an n-hy-n real matrix, the probability that you pick one with distinct 
eigenvalues is 100%. This is because the other real matrices are a set with Lebesgue measure 0. 

Topologically, amongst the n-by-n matrices, those with distinct eigenvalues form a dense open 
subset. Over R or C this is true in the usual topology; it is also true over any infinite field in the 
Zariski topology |.Tacobson 751 §1.9]. 
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and Jordan algebras. Many of the properties of the characteristic polynomial proved 
here can also be proved in that more general setting, see [Jacobson 681 Ch. VI]. 

The only tool we need that is not always discussed in a first-year graduate algebra 
course is the tensor product (g), for which we refer the reader to any good graduate 
algebra text. Heuristically, it allows one to make precise the notion of enlarging 
our base field F: if K is any extension field of F, then M„(F) (E)p K is isomorphic 
to Mn{K) as i^-algebras. For A an F- vector space, dimp A = dimK{A K). 

Let ai, . . . , am be an i^-basis for A. Let R ~ . • . , tm] for ti, . . . ,tm (com- 
muting) indeterminates, and let K be the quotient field of R. We call 7 — J^i ^i^-i G 
A®p K a. generic element. The iiT-span of 1, 7, 7^, ... is a subspace oi A® K, so 
it must be finite-dimensional over K. Hence there is a nonzero monic polynomial 
min. poly.^/;^ in K[x] of smallest degree such that min. poly.^yj^(7) = 0, called the 
minimal polynomial for 7 over K. 

Note that this polynomial is unique: If f{x) and g{x) are monic polynomials of 
minimal degree such that 7(7) = 3(7) = 0, then h(x) — f{x) — g{x) is a polynomial 
of smaller degree such that h^j) = 0. This contradicts the minimality of / and g 
unless h{x) = 0. 

Lemma 1.2. The minimal polynomial min. poly. ^^j^ is in R[x], not just K[x]. 

Proof. Consider the i?-submodules Aj of A (g) i? generated by {1, 7, 7^, . . . , 7-'}. 
They form an ascending chain C A2 C • • • . Since R is noetherian (Hilbert's Basis 
Theorem) and A(E) R is a finitely-generated i?-module, this chain must stabilize. 
That is, 7j+i is in Aj for some j, so 7 satisfies a monic polynomial / in Since 
min. poly.^/^ divides / in K[x] and both are monic, min. poly...j,y^ lies in R[x] by 
Gauss' Lemma. □ 

Definition 1.3. Write a G A with respect to the basis ai, . . . , Om above as a = 
aifli for ai & F. The substitution U 1-^ ai defines a map R[x] F[x]. We call 
the image of min. poly.^/^ in F[x] the characteristic polynomial of a and denote it 
by ch.poly.^^^/^ or simply ch.poly.^. 

Remark 1.4. It is immediate from the definition that 

deg(ch. poly.„^^/j?) < dimp A 

and that deg(ch. poly.^ a/f) the same for all a G A. 

Example 1.5 (Upper-triangular matrices). Let A be the algebra of n-hy-n upper 
triangular matrices over F. Write Eij for the matrix whose only nonzero entry is a 
1 in the (i, j)-position. Fix a basis oi, . . ., am for A over F consisting of -By 's with 
ai = Eii for 1 < i < n. Let 7 be the generic element defined above. 

Let /„ denote the n-by-n identity matrix. For each \ < i < n, the matrix ^ — til^ 
has n—1 pivot columns — equivalently, n—1 leading I's — in its row- reduced form, 
hence it has a nonzero kernel. That is, 7 has an eigenvector in with eigenvalue 
ti. Since the ti are distinct elements of K, these eigenvectors form a basis for K"^ 
and 7 is similar in Mn{K) to the diagonal matrix with diagonal entries ti, t2^ . ■ ., 



A more direct argument would be: R[y] is an i?-submodule of A Cg) -R and yl (g) i? is a finitely- 
generated ij-module. Hence 7 is integral over R IHu 801 Thm. VIII.5.3]. Unfortunately, the typical 
proof of this implication invokes determinants, so we use instead that R is noetherian. 
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tn- The minimal polynomial of 7 is nr=i(^ ~ ^Oi since similar matrices have the 
same minimal polynomials.'^ 

By substitution, an upper triangular matrix b (z A has characteristic polynomial 
ULiix-bii). 

Proposition 1.6. The characteristic polynomial ch. poly.^ j^^p depends only on a, 
A, and F (and not on the choice of basis for A). 

Proof. Suppose that we have another _F-basis bi, . . . ,b„i of A with a corresponding 
generic element e = '^^tibi. We may write bi = ^jdijCij for g an invertible m- 
by-TO matrix in M^iF). Let f : R R he the i^-algebra automorphism defined 

by 

i 

Write a E A in terms of both bases as 
(1.7) ^ = Y1 = 

j i 

We have a diagram 

R[x] F[x] 

'1 II 

R[x] -^^^ F[x] 

with horizontal arrows the substitution maps. Equation (|1.7|) gives that aj = 
12i0i9ij fo'^ ii hence the diagram commutes. 

If we begin with min. poly.^/^ in the upper left, substitution gives ch. poly.„ 
computed with respect to the basis ai,...,am in F[x]. On the other hand, / 
extends in an obvious way to an automorphism oi A^ R such that 

3 j i i j 

Hence /(min. poly. ^/^) = min. poly. g^^. The image of this in F[x] is ch. poly.^, 
computed with respect to the basis bi, . . . ,bm- The commutativity of the diagram 
gives the claim. □ 

Lemma 1.8. Let E be a field containing F and fix a E A. The minimal polynomial 
and characteristic polynomial of a is the same over F and over E. 

The following precise statement of the lemma and its proof are technical and are 
best skipped by the casual reader. 

Proof. More precisely we want to prove: 

min. Voh-a/F = mm. Voh-(a®i)/E 

and 

ch. poly.„ = ch. My-(am)AA'^E)/E ■ 
Since F is a field, a generates a free submodule B of A with basis 1, a, a^, . . ., 
Q^d-i ^ ^YiQ degree of min. poly. Then B ^ E is a free i?-module with the 

■^This argument may appear to be excessively long. It is included here to illustrate that we are 
not making use of determinants. 
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same basis, hence the degree of min. poly.^^j^j^^/^; is > d. Since this polynomial 
divides min. poly.^^j^, they are the same."* 

The F-basis oi, . . ., am of A gives an E'-basis oi (g) 1, . . ., Um (E) 1 oi A(E) E, and 
the generic element constructed from this i?-basis is 7 (g) 1 e E{ti, . . . ,t„i). Since 
the minimal polynomials of j/K and (7 1)/E{ti, . . . ,t„i) arc the same by the 
preceding paragraph, we get 

ch. poly.<,_^/j. = ch.poly.(„gi) 

by substitution. □ 

In general, we write 

(1.9) ch.poly.„(a;) = - ci{a)x''-^ + ■■■ 

••• + (-ir-ic„_i(a)x+(-l)"c„(a). 
The elements ci(a) and c„(a) play the roles of the trace and determinant of a. 

Proposition 1.10. Let A be a finite- dimensional F- algebra. Then: 

(1) (Cayley-Hamilton) ch. poly. ^ (a) = for all a E A. 

(2) If f : A A is a ring automorphism or anti- automorphism of A which 
restricts to be an automorphism of F, then /(ci(a)) = Ci{f{a)) for all a £ A. 

(3) Ci{aa) — a^Ci{a) for a E F and a £ A. 

(4) Ci: A ^ F is F -linear. 

(5) If B is a subalgebra of A and b is in B, then ch. poly.^^^ jp divides ch. poly.^^yp. 
in F[x\ . 

We will observe in 12.31 below that our notion of characteristic polynomial on 
Mn{F) is the same as the usual one. Then Prop. 11.101 contains many results 
that one typically proves in a linear algebra course. For example, (2) gives that 
det(a*) — det(a) for a £ Mn{F) and that similar matrices have the same charac- 
teristic polynomial. 

Proof. (1): Write a = aiOi. Then ch. poly.„(a) is the image of min. poly. ..^/^^ (7) G 
A® R under the substitution map A(E R — > A F — A given by ti 1 — > o^i. Since 
min. poly. (7) = in A R, we have ch. poly.jj(o) = in ^. 

(2): Suppose first that / is an automorphism of A. The diagram 

A(g)R F[x] 

/^Idflj |/ 

A(g>R F[x] 

commutes. If we begin with min. poiy.^/j^- in the upper left, we obtain ch. poly.^j in 
the upper right F[x], and then /(ch.poly.^) in the lower right F[x]. On the other 
hand, we obtain f {min. poly ..^^j^) = min. poly. yj-^-)/^ in the lower left and then 
ch. poly.^(^-) in the lower right. That is, we have the desired equality ch. poly. jj^j) = 
/(ch. poly.„). 



This paragraph may be replaced by the sentence: Field extensions are "faithfully fiat". See 
|AM69I pp. 45, 46] for a definition. 
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The same argument works in the case where / is a ring anti-automorphism, 
except that in the diagram we must replace A ^ R in the lower left corner with 
(g) R where A°p denotes the F-algebra A with multiplication reversed. 

(3) and (4): Suppose first that a is not 0. If we write the minimal polyno- 
mial of j/K as '^"^QCix'^ for Ci G R, then the minimal polynomial of a'j/K is 
Er=oC»a""'^'- Thus 

a" min. poly .^/^ (x) = min. poly-a^/K i^x) in R[x] , 

and 

ch. poly.Q(x) = ch. poly.Q,Q(ax) in F[x]. 

Then we have a'^Ci{a) — a^^'^Ci{aa) for < i < n, which proves (3) for a 7^ 0. 
Since ci: A F is given by a polynomial in the coordinates of a € A with respect 
to some basis oi, . . . , a„i, this polynomial is homogeneous of degree i. This gives 
(4), as well as (3) for a — 0. 

(5): Fix a basis bi, . . ., br of B and extend it to a basis ai, . . ., am of A with 
fli = 6i for 1 < i < r. By analogy, set S ~ F[ti, . . . ,tr], let L denote the quotient 
field of S, and let e be the generic element J2l=i ^i^i in A (g) S*. 

Write h = (iibi for (3i G F. Let </> : i? ^ S* be given by sending tj ^ for 
r < j < m. The image of min. poly.^/j^ under the composition 

R[x] — ^ 

is ch. poly.^ Similarly, the image of min. poly. e S'[a;] is ch. poly.f, ^/j? g 

The homomorphism ip extends naturally to a map A ^ R ^ A(E) S such that 
<j){'y) = e. We have 

= (j)(Q) = 0(min.poly...^(7)) = (/i(min. poly.^)0(7), 

which is the polynomial 0(min. poly.^) G S[x\ evaluated at 4'{'^) — s. Consequently, 
min. poly. divides (/)(min. poly.^/^) in L[x\. Since S* is a UFD, min. poly. di- 
vides 0(min. poly.^/^) in S[x] . Consequently, the image ch. poly.^^^ of min. poly.^ 
in F[x\ divides the image ch. poly.^, j^jp of (/)(min. poly.^^^). □ 

2. Matrices 

In this section, we observe that the characteristic polynomial as defined above 
agrees with the usual linear algebra notion of characteristic polynomial in the case 
where A = M„(F). 

Everyone knows the next lemma, but maybe not the clean proof: 

Lemma 2.1. Let T be a linear transformation 0/ an F-vector space of dimension 
n > 1. Then T satisfies a nonzero polynomial of degree < n. 

Proof. We sketch the nice proof from |Bu 73 j . Let w be a nonzero vector in the 
vector space V. The n -I- 1 vectors 

v,T{v),T\v),...,T^v) 

must be linearly dependent, so there is a polynomial .9(2;) G -^[2;] of degree < n 
such that g{T)v^O. 
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Set U :— kei- g{T). The linear transformations T and g{T) commute, so T{U) C 
U and T induces a linear transformation Ty/n on V/U. By induction, T\u sat- 
isfies a polynomial mu{x) of degree < dimC/ and Ty/u satisfies a polynomial 
'niv/u{x) of degree < dimF/C/. Then mv/u{T)V C U and T satisfies the poly- 
nomial mu{x) mv/ui^)- Moreover, 

deg{mu ■ mv/u) — deg(m[/) + Aeg{my/u) < AmiU + dim{V/U) = A\mV. □ 

Corollary 2.2. The characteristic polynomial (in our sense) of a matrix in Mn{F) 
has degree < n. 

Proof. Lemma 12.11 applies in particular to 7 G M„ (K) as a linear transformation 
of . Substitution gives the corollary. □ 

Fix an algebraic closure F of F. For a G Mn{F), we call X G F an eigenvalue of 
a if the kernel of (A/„ — a) is nonzero. Let Ux denote the corresponding generalized 
eigenspace, i.e., the set of vectors v F lying in the kernel of (A/„ — aY for some 
natural number r. The multiplicity m{X) of an eigenvalue A is dim^? U\. 

Theorem 2.3. For a G Mn{F), the characteristic polynomial (as defined in \l.!^) 
of a factors in F[x] as 

(2.4) {x - Ai)"(^^)(x - Aa)"^^^' ■ ■ ■ {x - Afe)"(^'=\ 

where Ai, . . . , A^ are the distinct eigenvalues of a. 

Proof. Let B be the subalgebra of Mn{F) consisting of upper triangular matri- 
ces and let a' be in B. The characteristic polynomial ch. poly.^, g jp of a! as an 
element of B was computed in Example 11.51 it is of the form H2.4|l . Bv 11.101 5. 
ch. poly.^/ Qjp divides the characteristic polynomial ch. poly.^/ m„(f)/f of a' as an 
element of Mn{F). Since both polynomials are monic and have degree n (hv 11.51 
and l2.2|l . the theorem holds for upper triangular matrices. 

Since the characteristic polynomial of the given matrix a is unchanged under 
scalar extension, we may assume that F is algebraically closed, i.e., where F = F. 
Here we need one somewhat sophisticated result from linear algebra: Since F is 
algebraically closed, a is similar to an upper triangular matrix a' "Ax 95" Thm. 6.2]. 
But the theorem holds for a' by the preceding paragraph. Since similarity changes 
neither the characteristic polynomial (|1.1UI 2') nor the eigenvalues, the theorem holds 
for a. □ 

In |Ax 95j ■ Axler develops many of the typical properties of matrices (e.g., 
the existence of eigenvalues and the decomposition with respect to generalized 
eigenspaces) over an algebraically closed field without use of the determinant. For 
example, in §5 of that paper he defines the characteristic polynomial to be exactly 
the product displayed in the theorem. Logically, one could insert the contents of 
this paper at that point in his. 

Corollary 2.5. For a £ M„{F), the minimal polynomial min. poly .^^ and the char- 
acteristic polynomial ch. poly.^ have the same irreducible factors in F[x]. 

Proof. Irreducible polynomials in F[x] are determined (up to a scalar factor) by 
their roots in an algebraic closure F. Thus we may assume that F is algebraically 
closed. 
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As in the proof of Th. 12.31 a is similar to an upper triangular matrix since F is 
algebraically closed. Since conjugation changes neither the characteristic nor the 
minimal polynomial, we may also assume that a is upper triangular. 

By Th. 12.31 every irreducible factor of the characteristic polynomial is of the 
form {x — \i) where \i is a diagonal entry in a, say = an. The (i,i)-entry of 
min. poly.^(a) = is 0, but it is also min. poly.^(Ai). Therefore, {x — Xi) divides 
min. poly.£(, and ch. poly.^ divides min.poly.^. 

Since min. poly.^^ divides ch.poly.^j by Cayley-Hamilton, the corollary is proved. 

□ 

Proposition 2.6. For a and a' in Mn{F), the following are true: 

(1) a is invertible if and only if Cnia) ^ 0. 

(2) (Jacobi formula) c„(a) = E<TGS„('^iS'^'^)'^i'j(i)"2<T(2) • ■ ■anfT(«)- 

(3) c„(aa') = Cn{a)cn(a'). 

Proof. (1): The matrix a is invertible if and only if the kernel of a is trivial, if 
and only if is not an eigenvalue of A. By Theorem 12 .31 this is true if and only if 
c„(a) ^ 0. 

We now follow jAx 951 §9]. Write d{a) for the right-hand side of (2). A straight- 
forward rearrangement of terms as in jB W 891 p. 179] shows that d{aa') — d{a)d{a'). 
Therefore, (2) implies (3). 

We now prove (2). Suppose first that a is upper-triangular. Then both sides of 
(2) are just the product of the diagonal entries of a, hence (2) holds in this case. 

Now consider the general case. Since c„ (a) and d{a) are unchanged if we enlarge 
our base field, we may assume that F is algebraically closed and hence that a is 
similar to an upper triangular matrix a', i.e., a' = bab^^ for some b G Mn{F). Then 

c„(a') = d{a) = d{bab-^) ^ d{b)d{ab-^) = d{ab-^)d{b) = d{ab-^b) = d{a). 

Since c„(a) = c„(a') bv ll.l()l 2. we have proved (2). □ 

Definition 2.7. For a an element of a finite-dimensional i^-algebra A, we define 
the trace of a to be 

irA/ria) ■= ci(a) 

and the determinant of a to be 

detA/_F(a) c„(a). 

If there is no danger of ambiguity, we write simply det^ or det instead of det^/F 
and similarly for the trace. 

By Theorem l2.3l the trace tiMniF) and determinant det m„{f) are the usual trace 
and determinant from linear algebra. 

3. Quaternions 

Example 3.1. Hamilton's quaternions — usually denoted by HI — are defined to 
be the ring constructed by taking the complex numbers C and adjoining an element 
j such that = —1, j commutes with real numbers, and ij — — j'i. Note that EI is 
an R-algebra but not a C-algebra, since C is not in the center of H. It has R-basis 
1, i, j, fc, where k = ij. A lot of interesting information about the quaternions can 
be found in [Numbersl Ch. 7]. 
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Set 4> to be the M-linear map H M2{C) defined by 

This extends to an isomorphism </> : H C M2(C) (as C-algebras). 

Every quaternion can be written as q — r + si + uj + vk = z + wj for some reai 
numbers r, s, u, v and complex numbers z = r + si and w — u + vi. We have 

Since the characteristic polynomial is unchanged when we enlarge our base field, 
we find that 

tm{q) = trM2(C) ('/'('?)) = z + z = 2r 

and 

detn(g) = detj\/2(c)(0(q)) — zz + ww = + + + v"^ . 

Example 3.2 (Matrices over the quaternions). Write Af2(IHI) for the set of 2-by-2 
matrices with entries in H. The obvious addition and multiplication make it into a 
16-dimensional R-algebra. 

In |Cay 1845| , Cayley defined a determinant 

Cdet: Af2(Il) by Cdet f 911922 - 921922 

\921 922 / 

He noted that his determinant has some unsavory properties, for example that 
Cdet r Yj = for aU 9, 9' e H, 



but 



Cdetl' \]^ij-ji = 2ij^{). 



\3 3, 

Let us constrast this with the determinant that we have just defined. Just as for 
above, there is an isomorphism 02 : Af2(Et) ® C ^ M4(C) such that 

^911 912A ^ f 0(911) <P{<112) 
^921 922/ V'?^(92i) 0(922)^ 

(Recall that 0(9) is a 2-by-2 complex matrices for every 9 e H.) If m e M2{S) has 
a repeated row or column, then so does 02 (m), hence 

detM2(H)("^) = detM4(C)(</'2(?™)) = 0. 

More generally, our trace and determinant have the nice properties of the usual 
trace and determinant for matrices as given in ll.lOl 

For a more comprehensive discussion of various types of determinants for M2 (H) , 
see [As 96| . Aslaksen refers to our detjv/2(H) as the Study determinant. 

Example 3.3 (Central simple algebras). A typical topic for a first year gradu- 
ate algebra course is Wedderburn's description of simple artinian rings: they are 
isomorphic to Mr{D) for D a skew field. Write F for the center of D (which is 



10 



R. SKIP GARIBALDI 



necessarily a field), and suppose that D is finite-dimensional over F. Such an alge- 
bra Mr{D) is called central simple. We have just seen two examples of these, with 
F = R, D = H, and r = 1, 2. 

The trace trjv^^(£)) and determinant det^/^^/)-) are called the reduced trace and 
reduced norm respectively. They are usually constructed by "Galois descent" as in 
|Dr 831 p. 145] or jPi 82| . but here we get them as a consequence of the existence of 
the characteristic polynomial, which is true for a much broader class of algebras. 

4. More properties of the characteristic polynomial 

Here we discuss the example of finite-dimensional field extensions (14.4(1 and prove 
some more nice properties of the characteristic polynomial ((4.5|l . 

Write EndF(^) for the set of i^-linear maps A ^ A. It is an F-algebra; its 
multiplication is function composition. It is isomorphic to Mm{F). 

For a e yl, write La for the element of Endi?(A) defined by 

La{b) = ah for b ^ A. 

The map a ^-^ La defines an i^-algebra homomorphism called the left regular rep- 
resentation of A. This homomorphism is injective: if La = 0, then La{a') — for 
all a' E A, hence = La{lA) = a ■ Ia = a. 

Example 4.1. Let A be M„(F) or more generally a simple ring with center F 
such that dimi? A = n^. One can show that 

(4-2) ch. poly.i^,E„dp(A)/F = (ch. poly. a^A/pT 

for every a € A, see e.g. |Bo48l Ch. VIII, §12.3]. 

Proposition 4.3. Let a be an element in a finite- dimensional F-algebra A. The 
minimal polynomial min. poly.^^^ divides the characteristic polynomial ch. poly.^, j^^p 
of a which divides the characteristic polynomial ch. poly.^^ EndF(A)/_F' '^^^ *^ 
All three polynomials have the same irreducible factors in F[x]. 

Proof. Since ch. poly.^(a) = by Cayley-Hamilton ((l.lQI l). the minimal polyno- 
mial of a divides the characteristic polynomial ch. poly.^. Since the left regular 
representation is injective, we have that ch. poly.^ divides ch. poly.^^^ bv ll.101 5. 

We are reduced to showing that ch. poly.^ and min. poly.^ have the same ir- 
reducible factors. Since the left regular representation is injective, a and La have 
the same minimal polynomials. That is, we need only show that ch. poly.^ and 
min. poly. ^ have the same irreducible factors. Since Endi?(^) is isomorphic to 
Mm(F) for m ~ dimp A, we are done bv l2.5l □ 

The proposition gives us the power to handle another example. 

Example 4.4 (Finite-degree field extensions). Let A be an extension field of F 
of finite dimension m. Every element a E A gives an element La in Endi?(A) = 
Mm (F) , and the characteristic polynomial of a divides the characteristic polynomial 
of La by the proposition. The trace and norm of a are typically defined to be the 
trace and determinant of La. 

If A is separable over F, then by the Theorem of the Primitive Element, A — F[d] 
for some 9 E A. The minimal polynomial of 9 has degree m. Since it divides the 
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characteristic polynomial of Le bv 14.31 and that polynomial has degree m, we find 
that 

ch. ^o\y. Q^A/F = ch. poly.ig . 
Since the characteristic polynomial has the same degree for all a g A, we have 

ch. poly. J, — ch. poly.^ for all a £ A. 

In particular, for finite separable field extensions, our trace and determinant agree 
with the usual trace and norm. 

If A is not separable over F, there can be some disagreement. For example, let 
F = F2(u, v) where F2 is the field with 2 elements and u, v are indeterminates. The 
field A — F{^Ju, ^Jv) is a purely inseparable extension of degree 4 with F-basis 1, 
\/u, ^Jv, ^Jmj. The generic element 

7 = • 1 + t2^/u + t^^/v + t4^/mj 

has minimal polynomial 

7^ - {tj + tju + tjv + tluv). 

Here the characteristic polynomial of each element a Cz A divides but is not equal 
to the characteristic polynomial of La- 

The proposition also allows us to prove that many nice properties of the charac- 
teristic polynomial of a matrix also hold for characteristic polynomials of elements 
of A. 

Recall (|1.4|l that the characteristic polynomial has the same degree for every 
a Cz A. We say that A has degree n if ch. poly.^ has degree n. 

Corollary 4.5. Let A be a finite- dimensional F-algebra of degree n. For a Cz A, 
we have: 

(1) Ci(l^) = ("). In particular, tiAi^A) ~ n and detyi(l^) = 1. 

(2) a is invertible if and only if det A{a) 7^ 0. 

(3) a is nilpotent if and only if ch. poly.^ — x"' . 

Proof. (1): The minimal polynomial of I a is x — 1, hence Proposition l4.3l gives that 

ch.poly.i^ 

(2, Define the adjoint of a, denoted by adj a, to be 

adj a = - ci(a)a"-2 + • • • + (-l)"-ic„_i(a)]. 

Then 

a • adja = (-l)"+i[ch. poly. Ja) - (-l)"c„(a)] ^ det^lajU. 
hence, if det A{a) is not zero, a is invertible. 

(2, If a is invertible, then La is invertible with inverse La-^. Bv 12.61 1. the 
constant term det-Endp (A) (La) of ch. poly.^^ is not zero. Since ch.poly. ^^ a/f ^^'^ 
ch. poly.^^ EndF{A)/F have the same irreducible factors in F[x], the constant term 
detA(a) of ch.poly. a A/F is not zero. 

(3): a is nilpotent if and only if it satisfies for some natural number r, if and 
only if it has minimal polynomial for some natural number p. Since the minimal 
polynomial and characteristic polynomial have the same irreducible factors this 
holds if and only if the characteristic polynomial of a is x". □ 
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Remark 4.6. One might be tempted to accept the traditional definition of charac- 
teristic polynomial for matrices, and then define the characteristic polynomial of 
a € A as ch. poly.^^ EndF{A)/F- (This is logically equivalent to the usual definition 
of the norm and trace in the case where A is a. finite-degree field extension of F, see 
e.g. |Lang 02} VI. 5. 6].) However, Example 14. II shows that in some cases one wants 
to take an n-th root of ch. poly.^^ . But there is a more serious problem: There 
is no strong mathematical reason to prefer the left regular representation over the 
right regular representation (defined in the obvious manner as a i— s- Ra), and the 
characteristic polynomials of La and Ra may differ. Adrian Wadsworth points out 
that the upper triangular matrices from Example 11.51 provide an example of this 
difficulty. In particular, the generic element 7 has 

ch. poly.ij7) = (a; - ti)"(a; - tz)"-^ ■ ■ ■ {x - tn-if{x - i„) 

and 

ch. poly.flj7) = {x- ti){x ~ ta)' ■ ■ ■ {x - tn-iT'^x - t„)". 

5. The product formula for determinants 

In this section, we prove that the usual product formula for determinants of 
matrices holds for an arbitrary finite-dimensional i^-algebra A: 

Theorem 5.1. For every a, a' £ A, we have 

det^ (aa') = det^(a) detA(a')- 

We postpone the proof until the end of the section. A somewhat different proof 
can be found in ' J acobson 681 §VI.5], but it is even more complicated and arcane 
than what you will find below. 

5.2. Recall that det^ is both a function A ^ F and an element of i? = F[ti, . . . , 
To evaluate the function detA on a G A, we write a in terms of our basis ai, . . ., 
a„i as a = cXiUi and substitute ti l—^ ai in the polynomial detA- In this manner, 
we may view each polynomial in i? as a function A —^ F. 

Lemma 5.3. Suppose that F is infinite. Let f £ R be such that 

/(1a) = 1 and f{aa') ~ f{a) f{a') for all a, a' £ A. 

If g £ R divides f and has g(lyi) = 1, then 

g{aa') — g{a)g{a') for all a, a' £ A. 

Proof. Since /(1a) — g{^A) — 1, we have f,g ^ in R. Suppose that / or 5 is a 
unit in R, i.e., is in R* — F*. Then 5 is a unit, hence g — 1 and the lemma holds. 

We may assume that / and g are nonzero nonunits. Since i? is a UFD, we 
may write / = /1/2 • • • /r where fk is irreducible in R for every fc = 1, . . ., r. By 
multiplying fk by an element of F* if necessary, we may assume that fki^A) ~ 1 
for every k. Since g divides / and g(lyi) = 1, 5 is a product 

g = fe for some L C [l,n]. 

eeL 

To prove the lemma, it suffices to prove that fk{aa') — fk(0')fk{a') for all k £ [1, n] 
and a, a' £ A. 
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Set R' — F[ui, . . . , Um, vi, . . . , Vm] for ui, . . . , Um, wi, . . . , independent inde- 
terminates, and let F' be the quotient field of R' . Set 

fi = UiUi and v = 'fjO'j in A (g) i?'. 

i 3 

As in 15.21 every element of R defines a map A® R' ^ R' . For example, to find 
/(/i), one takes / and substitutes ti ^ Ui. Similarly, one gets f(y) by substituting 

tj I—!- Vj. 

We claim that f{y.v) = f{n)f{v). Let 5 := /(/xi^) - f{p)f{v) in i?'. Every 
element of R' defines a function F^™ — > by plugging in for the the u's and d's. 
Substituting ui i— s- and Vj i— s- /3j into (5 for a^, /3j e _F, we obtain 

f{ab) — f{a),f{b) for a ~ a^Oi and b = ) 

which is by hypothesis. That is, 5 gives the map F^™ — > F which is identically 
0. Since F is infinite, i5 = 0, which proves the claim. 
Thus 

r r 

n = /O^^) = = n in R'. 

k=l k=l 

Recall that one obtains /fe(/i) and fk{v) by substituting one set of indeterminates 
for another in fk- Hence, since fk is irreducible in i?, the polynomials fkilj), fk{v) 
are irreducible in R' for all k. Therefore the prime factorization of fkifiv) in the 
UFD R' is a product of and /q(t^) for some p's and g's. Substituting in for 

the Wj's so that v is sent to 1^ maps 

/fe(/iz/) i-^ fk{^J■), /p(Ai) /p(/^), and fq{iy) h-^ /^(Ia) = 1 

for all p and g. Hence the only irreducible factor of fk{^^v) amongst the fp{fi) terms 
is equal to /fe(/i). Similarly, substituting in for the u^'s so that i-^- 1^^, we have 
fkifii^) /fe(i^), hence 

fkU^i^) = fk{^J^)fkW) for A; = 1, ... , n. 

By substituting /i h- s- a and i-^ a', we obtain that fk{aa') = fk{0')fk{oL') for all ctj 
a' e A. □ 

Proof of Theorem \5.U Since the determinant of an element of A is unchanged when 
we enlarge the base field (|1.8|) . we may assume that F is infinite. Let 7 = '^tiOi 
be a generic element of ^ as in and consider the element Lj in End^f {A (g) K) . 

The minimal polynomial min. poly...^ divides ch. poly.^,^ gndjc K[x\ by 
14.31 By 11.21 both polynomials actually lie in R[x\ for R = F[ti, . . . ,tm], hence 
min. poly. ^ divides ch. poly.^_^ in R[x\ by Gauss' Lemma. Substituting a; 1-^ 
defines a surjection R[x\ — > R which sends min. poly. ^ ^ (— l)'^detA for d = 
deg(min. poly.^) and ch. poly.^_^ to the function a i— s- (— l)"MetEndF(A) (-^a)- Since 
substitution is a ring homomorphism, det^i divides a ^ detEndj!.(A)(-^a) in R. 

We have 

detEnd^ (A) (-^aa') = dctEndy (A) (-^a-^a' ) — dctEnd^ (A) (-^a) dctEnd^ (A) (-^a' ) J 

where the last equality is bv l2.6l 3 since Endi?(A) ^ Mm{F). Note that 

detEndj7(A)(-^lA) = detM„(_F)(lM„(F)) = 1 
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and detA(lA) — 1 bv 14.51 1. Since F is infinite, Lemma [5.31 gives : 

detA(aa') = detA(a) detyi(a') 
as desired. □ 

To summarize, we defined the cliaracteristic polynomial of an element in a finite- 
dimensional i^-algebra in We defined the determinant det^ to be the constant 
term of this polynomial (|2.7() . In the case A = Mn{F), we found that detj^f^^^p-f is 
given by the Jacobi formula (|2.6I 2). hence the product formula holds for det^^i^p-f 
(12.61 3). Finally, we used Prop. H31 to prove the product formula for detyi (|5.1|l . 

6. Miscellaneous remarks 

This section is a survey of other results. It is necessarily briefer and more tech- 
nical than the rest of the paper. 

6.1. The "usual" definition of the characteristic polynomial. For a E A, I 

claim that the formula 

(6.2) ch. poly.^ = det(a; • lyi — a) 

holds in F[x]. But what does the expression "det(x • 1a — a)" mean? For matrices, 
the determinant is given by the Jacobi formula 12.61 2 involving the entries of a ma- 
trix; this formula makes sense whether the entries are elements of F or polynomials 
in F[x]. The same reasoning holds for our more general notion of determinant. 

One way to prove H6.2|l is as follows. First prove it for A the algebra of upper 
triangular matrices from Example ll.51 Then prove (|6.2I) for A = Mn{F) by reducing 
to the upper triangular case as in the proof of Th. 12.31 This implies (16.21) for general 
A by the arguments in | Jacobson 681 p. 225]. 

6.3. ch.poly.jjj — ch.poly.;,^^. If a and b are n-hy-n matrices, it is well-known that 
ch. poly.jjf, = ch. poly.f,(j. We can prove it directly for general A. Let a be one of 
the coefficients of the characteristic polynomial as in H1.9(l . so Ci: A ^ F and we 
want to show that Ci{ab) = Ci{ba) for all i. 

First assume that b is invertible. Then 

Ci{ab) = Cj(6(a6)6"^) = Ci{ba) 

bv [TTni 2. 

Now the set U of pairs {a,b) E A x A such that b is invertible is a nonempty 
open subset oi A x A in the Zariski topology. The map A x A F defined by 

(a, b) i~> Ci{ab) — Ci{ba) 

is given by a polynomial in the coordinates of a and b and is zero on U, hence is 
identically zero on Ax A. This proves that Ci{ab) — Ci{ba) for all a,b E A and all i. 

6.4. The Pfafiian. Suppose that charF ^ 2. Write Skew„(i^) for the vector space 
of skew- symmetric n-hy-n matrices, i.e., matrices a such that a* = —a. 

If n is odd, then det(-a) = -det(a) bv lLTUl S. Since det(a*) = det(a) bv ll.101 2. 
all n-hy-n skew-symmetric matrices have determinant 0. 
Suppose that n is even. There is a polynomial map 

(6.5) Pf : Skew„(i^) such that (Pf (a))^ = det m„(f) (a) 

called the Pfaffian, see e.g. |Lang 02| §XV.9]. It seems a bit mysterious! 
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(Note that Equation 1)6.5(1 only determines the Pfafhan up to sign; classicaUy one 
chooses an invertible skew-symmetric matrix S with det(5) = 1 and fixes the sign 
of Pf so that Pf(S') = 1.) 

In fact, the Pfaffian exists as a consequence of the characteristic polynomial as 
defined in ^ Fix a matrix S as in the preceding paragraph and define a multipli- 
cation • on Skew„(_F) given by 

a-b=^{aS-^b + bS-^a). 

This makes Skew„(_F) into a Jordan i^-algebra with identity element S. These 
algebras arise naturally in the classification of central simple Jordan algebras^; 
they are associated with a nondegenerate skew-symmetric bilinear form on F". 

As mentioned in the introduction, the theory developed in ^ can be extended 
to give a characteristic polynomial and a determinant for this Jordan algebra. One 
finds that the determinant is the Pfaffian: 

detskcw„(i^)(a) = Pf(a), 
see I Jacobson 681 pp. 230-232]. 

6.6. Matrices over rings. Apparently, we have given a canonical construction 
of an analogue of the characteristic polynomial and hence the determinant for any 
algebra over a field F. But the characteristic polynomial and determinant are 
typically defined for M„(i?) where R is merely a commutative ring with 1; our 
definition also works in that case with some modification. 

First, consider Af„(Z) as a Z-algebra, i.e., the case A — M„(Z) and F = Z. Since 
Z is a noetherian UFD and Af„ (Z) is a free Z-module of finite rank, the results in 
32 hold with this A and F. 

Lemma 11.81 holds if we require £' to be a noetherian UFD containing F so that 
we may define the characteristic polynomial over E. (Since A is a free -F-module, 
A(E)E is a free £'-module.) Let L be the field of rational functions in ti, . . ., tm over 
the quotient field of E. The minimal polynomial for 7 over K is also the minimal 
polynomial for 7 over L bv ll.8l hence characteristic polynomials are also unchanged 
by scalar extension in this more general setting. In particular, the characteristic 
polynomial of a matrix a G M„ (Z) is the same as the characteristic polynomial of 
a considered as a matrix in M„ (Q) , which is the usual characteristic polynomial by 
Th.Ol 

A coefficient Ci of the characteristic polynomial as in 1(1.9(1 is given by a poly- 
nomial in Z[ii, . . . ,tm\, and it is evaluated on a matrix a as described in 15.21 For 
example, undergraduates are taught how to evaluate c„ = det, which is given by 
the Jacobi formula 12.61 2 with respect to the standard basis of M„(Z). Just as for 
the determinant, one can evaluate Ci on a matrix in M„(Z) (E) R = Mn{R) for R 
any commutative ring with 1, and the resulting characteristic polynomial has the 
usual felicitous properties. 

References 

[AI6I] A.A. Albert, Structure of algebras, AMS Coll. Pub., vol. 24, AMS, Providence, RI, 

1961, revised printing. 



^See IJacobson 681 §V.7]. They correspond to Lie algebras of type C in the Killing-Cartan 
classification. 



16 



R. SKIP GARIBALDI 



[AM 69] M.F. Atiyah and I.G. MacDonald, Introduction to commutative algebra, Addison- 

Wcslcy, Reading, MA, 1969. 
[As 96] H. Aslaksen, Quaternionic determinants, Math. Intelligencer 18 (1996), 57-65. 

[Ax 95] S. Axler, Down with determinants!, Amer. Math. Monthly 102 (1995), no. 2, 139- 

154. 

[Ax 97] , Linear algebra done right, second ed. , Undergraduate Texts in Mathemat- 
ics, Springer, 1997. 
[Bo 48] N. Bourbaki, Algebre, Hermann, Paris, 1948. 

[Br 01] O. Bretscher, Linear algebra with applications, second ed.. Prentice Hall, 2001. 

[BW89] J.G. Broida and S.G. Williamson, A comprehensive introduction to linear algebra, 

Addison-Wcslcy, 1989. 

[Bu73] M.D. Burrow, The minimal polynomial of a linear transformation, Amcr. Math. 

Monthly 80 (1973), 1129-1131. 
[Cay 1845] A. Cayley, On certain results relating to quaternions. Philosophical Magazine 26 

(1845), 141-145 (= Mathematical Papers, vol. I, 123-126). 
[De48] M. Deuring, Algebren, Chelsea, 1948. 

[Die 38] L.E. Dickson, Algebras and their arithmetics, G.E. Stechert, New York, 1938. 

[Die 57] J. Dieudonne, Sur le polynome principal d'une algebre. Arch. Math. 8 (1957), 81-84. 

[Dr83] P.K. Draxl, Skew fields, London Math. Soc. Lecture Note Series, vol. 81, Cambridge 

University Press, Cambridge- New York, 1983. 
[Ha 96] J. Hannah, A geometric approach to determinants, Amer. Math. Monthly 103 

(1996), no. 5, 401-409. 

[He 75] I.N. Herstein, Topics in algebra, second ed., Xerox College Publishing, Lexington, 

Mass., 1975. 

[Hu80] T. Hungerford, Algebra, second ed.. Springer, 1980. 

[Jacobi 1841] C.G.J. Jacobi, De formatione et proprietatibus determinantium, J. Reine Angew. 

Math. 22 (1841), 285-318, (= Ges. Werke, vol. HI, pp. 355-392). 
[Jacobson43] N. Jacobson, The theory of rings. Mathematical Surveys, vol. 2, American Mathe- 
matical Society, New York, 1943. 
[Jacobson 63] , Generic norm of an algebra, Osaka Math. J. 15 (1963), 23—50, (= Coll. 

Math. Papers, vol. 2, pp. 489-515). 
[Jacobson 68] , Structure and representations of Jordan algebras, AMS Coll. Pub., vol. 39, 

AMS, Providence, RI, 1968. 
[Jacobson 75] , Pl-algebras: an introduction. Lecture Notes in Mathematics, vol. 441, 

Springer- Verlag, Berlin, 1975. 
[Kn94] E. Knobloch, Prom Gaufi to Weierstrafi: determinant theory and its historical 

evaluation. The intersection of history and mathematics. Science Networks, vol. 15, 

Birkhauser, 1994, pp. 51-66. 
[Lang02] S. Lang, Algebra, revised third ed.. Graduate Texts in Mathematics, vol. 211, 

Springer, 2002. 

[Lax97] P.D. Lax, Linear algebra, John Wiley & Sons Inc., New York, 1997. 

[Numbers] H.-D. Ebbinghaus, H. Hermes, F. Hirzebruch, M. Koecher, K. Mainzer, J. Neukirch, 

A. Prestel, R. Remmert, Numbers, Springer, 1991. 
[Pi 82] R.S. Pierce, Associative algebras, Graduate Texts in Mathematics, vol. 88, Springer, 

1982. 

[Sc 1891] G. Scheffers, Zuruckfuhrung complexer Zahlensysteme auf typische Formen, Math. 

Annalen 39 (1891), 293-390. 
[Study 1889] E. Study, Uber Systeme von complexen Zahlen, Nachrichten von der Koniglichen 

Gesellschaft der Wissenschaften zu Gottingen (1889), no. 9, 237-268. 
[Weber] H. Weber, Lehrbuch der Algebra, third ed., vol. 1, Chelsea, 1961? 

[Wei] K. Weierstrass, Zur Determinantentheorie, note based on lectures given in 1886/87. 

Appeared posthumously in Mathematische Werke, (J. Knoblauch, ed.), vol. Ill, 

1903, pp. 271-286. 

Dept. of Mathematics and Computer Science, Emory University, Atlanta, GA 30322, 
USA 

E-mail address: skipamember.ams.org 
URL: http : //www . mathcs . emory . edu/~skip 



